In vivo CRISPR screens reveal SCAF1 and USP15 as drivers of pancreatic cancer

Functionally characterizing the genetic alterations that drive pancreatic cancer is a prerequisite for precision medicine. Here, we perform somatic CRISPR/Cas9 mutagenesis screens to assess the transforming potential of 125 recurrently mutated pancreatic cancer genes, which revealed USP15 and SCAF1 as pancreatic tumor suppressors. Mechanistically, we find that USP15 functions in a haploinsufficient manner and that loss of USP15 or SCAF1 leads to reduced inflammatory TNFα, TGF-β and IL6 responses and increased sensitivity to PARP inhibition and Gemcitabine. Furthermore, we find that loss of SCAF1 leads to the formation of a truncated, inactive USP15 isoform at the expense of full-length USP15, functionally coupling SCAF1 and USP15. Notably, USP15 and SCAF1 alterations are observed in 31% of pancreatic cancer patients. Our results highlight the utility of in vivo CRISPR screens to integrate human cancer genomics and mouse modeling for the discovery of cancer driver genes with potential prognostic and therapeutic implications.

� D A description of all covariates tested � D A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons � D A full description of the statistical parameters including central tendency (e.g.means) or other basic estimates (e.g.regression coefficient) AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g.confidence intervals) l'vl D For null hypothesis testing, the test statistic (e.g.F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted � Give P values as exact values whenever suitable.
� D For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings � D For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes � D Estimates of effect sizes (e.g.Cohen's d, Pearson's r), indicating how they were calculated Our web collection on statistics for biologists contains articles on many of the points above.

Software and code
Policy information about availability of computer code

Data collection
Provide a description of all commercial, apen source and custom code used ta collect the data in this study, specifying the version used OR state that no software was used.

Data analysis
Provide a description of all commercial, open source and custom code used to analyse the data in this study, specifying the version used OR state that no software was used.
For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors and reviewers.We strongly encourage code deposition in a community repository (e.g.GitHub).See the Nature Portfolio guidelines for submitting code & software for further information.

Data
Policy information about availability of data All manuscripts must include a data availability statement.This statement should provide the following information, where applicable: -Accession codes, unique identifiers, or web links for publicly available datasets -A description of any restrictions on data availability -For clinical datasets or third party data, please ensure that the statement adheres to our J2Q!kY The RNA-seq data generated in this study have been deposited in the NCBI Gene Expression Omnibus under GEO accession GSE220556.All remaining data can be found in the Article, Supplementary and Source Data files.

Recruitment Ethics oversight
Describe how participants were recruited.Outline any potential self-selection bias or other biases that may be present and how these are likely to impact results.
Identify the organization(s) that approved the study protocol.
Note that full information on the approval of the study protocol must also be provided in the manuscript.

Field-specific reporting
Please select the one below that is the best fit for your research.If you are not sure, read the appropriate sections before making your selection.

D Behavioural & social sciences D Ecological, evolutionary & environmental sciences
For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf

Life sciences study design
All studies must disclose on these points even when the disclosure is negative.

Sample size
No statistical methods were used to pre-determine sample sizes.Sample size for in vivo screening studies were empirically determined using barcode library recovery studies such that adequate coverage (>200 fold) of screening library could be achieved.Sample sizes for other in vivo experiments such as target validation, with or without use of the checkpoint inhibitors, was determined based on prior knowledge of typical biological variability with the tumor models and immunotherapy treatments.

Data exclusions
No data were excluded from the study.

Replication
Results presented in the manuscript were replicated at least twice in independent experiments or in two complementary tumor models.
Results between replicates were reproducible.

Randomization
Allocation of experimental animals to the treatment groups were randomized to treatment condition(s) by cage following tumor onset.

I
.I

I .I
Covariant are not relevant to this study as isogenic mice with clean genetic background were used.In vitro experiments were performed using identical cellular preparations and were randomly assigned to the different experimental conditions.

Blinding
All tumor measurements conducted in this study were blinded to group allocation.All bioinformatics studies were automated and can be considered as blinded.The researchers conducting TCGA analysis of human PDAC samples and QRT analysis were blinded to group allocation.For experiments other than mentioned here, investigators were not blinded to treatment groups or genotypes, as knowledge of this information was essential to conduct the studies. .I

involving human participants, their data, or biological material
Policy information about studies with human participants or human data.See also policy information about sex.gender (identity/presentation). and sexual orientation and race.ethnicity and racism.Indicate if findings apply ta only one sex or gender; describe whether sex and gender were considered in study design; whether sex and/or gender was determined based on self-reporting or assigned and methods used.Provide in the source data disaggregated sex and gender data, where this information has been collected, and if consent has been obtained for sharing of individual-level data; provide overall numbers in this Reporting Summary.Please state if this information has not been collected.Report sex-and gender-based analyses where performed, justify reasons for lack of sex-and gender-based analysis.
I I Research Population characteristics Describe the covariate-relevant population characteristics of the human research participants (e.g.age, genotypic information, past and current diagnosis and treatment categories).ff you filled out the behavioural & social sciences study design questions and have nothing to add here, write "See above."